CDS

Accession Number TCMCG034C38472
gbkey CDS
Protein Id XP_028952769.1
Location join(20135990..20136346,20136954..20137180,20137597..20137661,20138617..20138691,20139142..20139488,20146716..20146784,20146865..20146953,20147826..20150030,20150155..20150232,20150363..20150471,20151040..20151090,20151212..20151262)
Gene LOC114822535
GeneID 114822535
Organism Malus domestica

Protein

Length 1240aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA534520
db_source XM_029096936.1
Definition dentin sialophosphoprotein-like [Malus domestica]

EGGNOG-MAPPER Annotation

COG_category S
Description Occludin homology domain
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko04121        [VIEW IN KEGG]
KEGG_ko ko:K11807        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGTACTCATCCAAGTCCGGCCGAGTTGGCGGCGGTGGCTCAGGACGCGGAGTCGGAGCCGCCAAGCTTGGCCGAGCCTCCTTCCCTCCGCCCCTCCGCTCCTCAGCCCAAACCAGCCGCCTCTCCCTCGGAGGCTCCAATCCCAGAGGCCGCAACTCAGGCTCCAATACGTCAGCAGCCGCCGCCCCGCCGGCCGTGGAAGAGCAGTTCAGTTTAGTCGCCGGGCGTAACCCACTTGCTTTCTCTGTGATCATTCGTATGGCGCCGGACTTGGTGGAAGAGATCAAGCGCGTCGAAGACCAAGGTGGTGCGCCCCGCATCAAGTTCGGTCCGTCGCCTAACAACTCCCTCGGCAATGTTATTGATCTGGGTGGGAAGGAGTTTACATTTACATGGGACCAGGAAGTTGGTGACCTCTGTGACATTTATGAAGAGCGTGAAAGTGGTGAAGGTGGAAATGGTTTGCTAGTTGAATCAGGATTTGCATGGCGCAAAGTTAATGTGCATCGGATCTTAGATGAGTCAACTAAGAACCGTGTTAAACAGCGGTCAGAGGAAGCTGAGCGCAAGAATAAATCCCGCAAAGCCATTGTGTTAGAACAATGTACAGAGACGAAGAATCAGTTAAAGCAATATGATACTACAACAGCTAATCAACCTTGGAGGAATTTCAAACGAAAGAAAGAGACTCCCTTTAAGAAGCAGAAAGTTGAACTGCCTCAAGCTCCCCCTAAATCTACAGTTAAAACTGGAGTGTCATCAACAACTACTGCAAAGGGTAAGTACTCATCATCACCTCTTCAAACTCTACCTGAGCAATCTGGTGCTCTACCATCTCCGTTAAGAACTAGAAATAATTTGATGAGTCATGCAAGTGTGGAAGATATTATACCCTTTCAAGTGATTAAAAAAGACAAACCTGCTCCAAGCTCTGATAAAGAAATCCAAATCACGACCAGTATTGTACCACAAGAAACAGTGGGCAAAGGCAATGGTGGAGCTAATGTTACAGATTTGCAGATTATGTTGATTGCTTTGCTCACGGAGAATCCCAAAGGCATGAGAATAAAGGATTTAGAGAAAGCTATTGCAGATTCATATCCCAACTCTATAAAGAAAATTGAGCCCATTATTAAGAAGATTGCTAATTATCAAGCTCCAGGGAGATACATCTTGAAATCAGGAGTGGAGTTAGAAAGCTTAAAGAAACCTGTGTCTGAAAGTGGAAGTTCTCCCGAGGAGAAACTTCGCCAGAAACCTGCTTCTGATGGTAACCGTGAATTGCCTGTTCAAGCACCACACTTTGAGGAGAAAGTTTCTCCTAATGAGTTGGAGGAGCAGGTTCACGAACGATTGGATGCTACAATTGGTCAGGATTCAAGTGCATTGATTAAAATTGATAACCAACAGCATTCCGCTGATTTTTTTGGTGAAAAAAGGGGCTCAGACGATAATGAAGCGCGAGGAGGAAGCTCTAGCAATACTGGGAGTGCCAGTGACAGTGATAGTGACAGCAGCGATACCGGAAGTGACAGTGGAAGTCATAGCAGAAGTAAAAGTCCAGCACAAAATGGGAGTGGAAGCAGTAGTGATAGTGAAAGTGATGCATCTTCTAATAGCAAGGAAGGTTTTGATGAGGATGTGGATATTATGGCAAGTGACGATGACAAAGAACTGAAGCTTAAATTGCAAGGCTCTGAACAAGGGTTTTCTACTTTGCCCATCCCATGGAGAACTCCTGATGGCACGAATGTGCAGAGTGGGATTGATGAGAAGCAAGATGATCTTGAAGTTGACGCAGTTGAGATTGAAAAACAGTTGCTTGATGCTGATCAGGTTGCTGAATTGGCTGTAGTTAGTAACTCTAATCGTAATAAAAGTGAAACTCTAGAAGAAACAAAACCCTTTTCACCTGAGCATGGTGAGCTGCAAGGCCGCCAAAGTTTGGTAGACCCTGTGTATGGAGGGGAGAGTTTAGCCAAGGATGACTTCAGATATGAGCAGTCTGACAGTTCTGAGAGGATATCTAAAGGTATATCCAAACAAGGTTCTGAGGTAAAGCACACTGATGAGAAATCTAAAAACAATAAAAGATTAAAAACAGAAATATCACAGCAACCTTCTATTTCTGCGGGTAAAGGTGTCCATCTTTCAGAGGCTTCTCACAATTTGTCTCCTGATAGACTTGTTGAAGGCTCTCATAAGGACCCTGTCTCTCAAATGATGGATAGAGATGATAGGGATGAGAATATTGAATCTGGATCACTAAAGGTGTGCAACCAAGCATTTTCTGGCAGATATACTTCAGAGTTTAAACAATCAAGTTGGCAGTCTTTTGATCATAATTTACAGACAAAGGTTCCTGATTCAGCAGAAAGACCAGATAGATATGCGGAGAGCTTAGGGCATGGCGGGAAGTACTCTGAAAAGAGCTCTCAAATGCATGACGGTTACCTCTTACAAAAAGATAAATTTCATAGAGATACTCAAAATGAGGATGGTTATGCTAATGAGAGAAAGGTTCCAGGAAATTCTAAGGAAGGTGGTACCCGAGGCAAACAGTTAGTGCCCTTGGATCCCCACTACCAGAAAGATGGTGATATGGCTGGAAAGATCAAGGGAAGTGCACAAGTTTCCAGCCTGTTATTGGGTTCTTCACCAAAGTATAACAGCAGAAATGGTGCAGCCGCATCCCCTGTCGTTAATGGCAAAGGCAGTAAACTCCAAAGAGAGTTTTCAGACTTGGAGTTGGGCGAACTTCGTGAGCCCTTGCCTGAGGAAATGACAGTTAATAAGCAATTTGAGAGAAAAAGTTCTTTTAAGCAGTCAGATAACAAAAAGAGCACTTCAGAGAACCAGGTTTCTGAATTCAGTAAAGTGAAACGTGCTGGAAAGGCAAATTTTGATTCAGGAAGGCCAGCCTCTCCAGATTTAAACTCTAAGTTTCCAAGTAATCAGGAAGGCTTGAATAAAAAGAGGAACTACGAAGATCGCATTGAGGATTTAACAAGGTCTCAACAGAGAGCTGTGCAGTCTGAGTCGCAACACCCGTCACGAGTAGATCATCCTGATTTGGGGCGTTTGTTTAGCAAAACAGTCGGTCTTAGTAGTAAATCTAGACAGAATGAAGTTGGAGGCAGACAAGCAATTGGTGTGGCTGGACATGGAGAAAGCAATAAGAAAGCAACCCCAAGTGCTCCTCAGCAGCATAACTCAAAACGAGGGCTAGTTTCCCACCCCATACAAGAAAGTAAAAGACATGCATCCAATATAATGGTGGATTCGACTGATGTACGAAAGAAATCAATGGTGGCAGACGGCAATGACACTGACAGAAAAAAGAGGGATTCTTCTTCAGATGAGAATAGTTGTTCCTATTCTAAGTATGAGAAGAATGAGCCCGAGCGCAAGGGACCAATAAATAATTTCTCTCAGTACGAAGAGTATGTGCAAGAATATCGTGATAAGTATGATAGCTATTGCTCGTTGAACAAAACCATAGAGAGTTACAGGAATGAGTTTGAAAAGCTAGGAAAGGACCTTGATTATGCTACTCAAGGCACGGAGAAATATTATAAGATCTTGGGGCAGGTGAAGGAATCATATCGTCGATGTGAAAGGAAACATACGAGGCTGAAAAGAATATTTGTGGTACTTCATGAAGAATTGAAGCACATTAAGCAACGGATTAAGGACTTTGCAGTCTCATACATGGAGGACTGA
Protein:  
MYSSKSGRVGGGGSGRGVGAAKLGRASFPPPLRSSAQTSRLSLGGSNPRGRNSGSNTSAAAAPPAVEEQFSLVAGRNPLAFSVIIRMAPDLVEEIKRVEDQGGAPRIKFGPSPNNSLGNVIDLGGKEFTFTWDQEVGDLCDIYEERESGEGGNGLLVESGFAWRKVNVHRILDESTKNRVKQRSEEAERKNKSRKAIVLEQCTETKNQLKQYDTTTANQPWRNFKRKKETPFKKQKVELPQAPPKSTVKTGVSSTTTAKGKYSSSPLQTLPEQSGALPSPLRTRNNLMSHASVEDIIPFQVIKKDKPAPSSDKEIQITTSIVPQETVGKGNGGANVTDLQIMLIALLTENPKGMRIKDLEKAIADSYPNSIKKIEPIIKKIANYQAPGRYILKSGVELESLKKPVSESGSSPEEKLRQKPASDGNRELPVQAPHFEEKVSPNELEEQVHERLDATIGQDSSALIKIDNQQHSADFFGEKRGSDDNEARGGSSSNTGSASDSDSDSSDTGSDSGSHSRSKSPAQNGSGSSSDSESDASSNSKEGFDEDVDIMASDDDKELKLKLQGSEQGFSTLPIPWRTPDGTNVQSGIDEKQDDLEVDAVEIEKQLLDADQVAELAVVSNSNRNKSETLEETKPFSPEHGELQGRQSLVDPVYGGESLAKDDFRYEQSDSSERISKGISKQGSEVKHTDEKSKNNKRLKTEISQQPSISAGKGVHLSEASHNLSPDRLVEGSHKDPVSQMMDRDDRDENIESGSLKVCNQAFSGRYTSEFKQSSWQSFDHNLQTKVPDSAERPDRYAESLGHGGKYSEKSSQMHDGYLLQKDKFHRDTQNEDGYANERKVPGNSKEGGTRGKQLVPLDPHYQKDGDMAGKIKGSAQVSSLLLGSSPKYNSRNGAAASPVVNGKGSKLQREFSDLELGELREPLPEEMTVNKQFERKSSFKQSDNKKSTSENQVSEFSKVKRAGKANFDSGRPASPDLNSKFPSNQEGLNKKRNYEDRIEDLTRSQQRAVQSESQHPSRVDHPDLGRLFSKTVGLSSKSRQNEVGGRQAIGVAGHGESNKKATPSAPQQHNSKRGLVSHPIQESKRHASNIMVDSTDVRKKSMVADGNDTDRKKRDSSSDENSCSYSKYEKNEPERKGPINNFSQYEEYVQEYRDKYDSYCSLNKTIESYRNEFEKLGKDLDYATQGTEKYYKILGQVKESYRRCERKHTRLKRIFVVLHEELKHIKQRIKDFAVSYMED